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Abstract. Let F denote the modular group SL{2,Z) and Cn{T) the number of congruence 
fVl ' subgroups of r of index at most n. We prove that hm -r, °f'> ,? , = . . We also 

»- H , b f f n^oc (log n)-^/ log log n 4 

present a very general conjecture giving an asymptotic estimate for Cn(r) for general arith- 
metic groups. The lower bound of the conjecture is proved modulo the generalized Riemann 
hypothesis for Artin-Hecke L-functions, and in many cases is also proved unconditionally. The 
upper bound is proved in full in [LN]. 



o 

§0. Introduction 

Let k be an algebraic number field, O its ring of integers, S a finite set of valuations of 

K^ , k (containing all the archimedean ones), and Os = \x £ k \ v(x) > 0, ^/v ^ S\. Let G 

o> . ... . 

■^ . be a semisiniple, simply connected, connected algebraic group defined over k with a fixed 



(N 



a 
> 



embedding into GL^. Let F = G{Os) = G D GLd{Os) be the corresponding S-arithmetic 



O . group. We assume that F is an infinite group (equivalently, Hiygs G{k^) is not compact). 

(•~^ . For every non-zero ideal / of Os let 



T{I) = Ker{T-^GLd{Os/I)). 

A subgroup of F is called a congruence subgroup if it contains F(/) for some /. 

The topic of counting congruence subgroups has a long history. Classically, congruence 



X 

;_( ■ subgroups of the modular group were counted as a function of the genus of the associated 

Riemann surface. It was conjectured by Rademacher that there are only finitely many 
congruence subgroups of <S'L2(^) of genus zero. Petersson [Pe, 1974] proved that the number 
of all subgroups of index n and fixed genus goes to infinity exponentially as n — > oo. Dennin 
[De, 1975] proved that there are only finitely many congruence subgroups of 5L2(Z) of 
given fixed genus and solved Rademacher's conjecture. It does not seem possible, however. 
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to accurately count all congruence subgroups of index at most n in SL2{'L) by using the 
theory of Riemann surfaces of fixed genus. 

Following [Lu], we count congruence subgroups as a function of the index. For n > 0, 
define 

(7„(r) = ^ I congruence subgroups of F of index at most n|. 

Theorem 1. There exist two positive real numbers a-(F) and a_|_(F) such that for all 
sufficiently large positive integers n 

log n log n 

J7_loglogn - < (J^(Y\ < 72'°Sl°S" +. 

This theorem is proved in [Lu], although the proof of the lower bound presented there 
requires the prime number theorem on arithmetic progressions in an interval where its 
validity depends on the GRH (generalized Riemann hypothesis for Dirichlet L- functions). 
By a slight modification of the proof and by appealing to a theorem of Linnik [Lil, Li2] on 
the least prime in an arithmetic progression, the proof can be made unconditional. Such an 
approach gives, however, poor estimates for the constants. 

Following [Lu] we define: 

— logCn(r) ,^, .. logC„(F) 

a+(F) = lim — — - — , a_(F) = lim — — - — , 

A[n) A[n) 

where X(n) = , °f "'' . 

^ '' log log n 

It is not difficult to see that a_|_ and a_ are independent of both the choice of the 
representation of G as a matrix group and of the choice of S. Hence a± depend only on 
G and k. The question whether a+(F) = a_(F) and the challenge to evaluate them for 
F = SL2{1') and other groups was presented in [Lu]. Here we prove: 

Theorem 2. We have a+(5'L2(Z)) = a_(6'L2(Z)) = ^^^ = 0.0428932... 

The proof of the lower bound in Theorem 2 is based on the Bombieri- Vinogradov Theorem 
[Bo], [Da], [Vi], i.e., the Riemann hypothesis on the average. The upper bound, on the other 
hand, is proved by first reducing the problem to a counting problem for subgroups of abelian 
groups and then solving that extremal counting problem. 

In the case of a number field, we will, in fact, show a more remarkable result: the answer 
is independent of 0\ Here, we require the GRH (generalized Riemann hypothesis) [W] for 
Hecke and Artin L-functions which states that all non-trivial zeros of such L-functions lie 
on the critical line. 
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Theorem 3. Let k be a number field with ring of integers O. Let S be a finite set of primes, 
and Os as above. Assume GRH for k and all cyclotomic extensions k{Q) with i a rational 
prime and Q a primitive i^^ root of unity. Then 

a+{SL,{Os)) = a.{SL2{Os)) = l^zJ^. 

The GRH is needed only for establishing the lower bound. It can be dropped in many 
cases by appealing to a theorem of Murty and Murty [MM] which generalizes the Bombieri- 
Vinogradov Theorem cited earlier. 

Theorem 4. Theorem 3 holds unconditionally if the field k is contained in a Galois 
extension K such that either: 

(a) g = Gal(i^/Q) has an abelian subgroup of index at most 4 (in particular, if k is an 
abelian extension), 

or 

(b) [K:Q]< 42. 

The proof of the upper bound is very different from the proof of the lower bound. For 
a group A, we denote by Sr{A), the number of subgroups of A of index at most n. A 
somewhat involved reduction process is applied to show that the problem of finding the 
upper bound is actually equivalent to an extremal counting problem of subgroups of finite 
abelian groups (see section §5) which is given in Theorem 5. A sharp upper bound for that 
counting problem follows from the case R= 1 of the following theorem. 

Theorem 5. Let R > 1 be a real number and let d be a fixed integer > 1. Suppose that 
A = Cxj X Cx2 X • • • X Cxt is an abelian group such that the orders xi,X2, ■ ■ ■ ,xt of its cyclic 
factors do not repeat more than d times each. Suppose that r\A\^ < n for some positive 
integers r and n. Then as n tends to infinity, we have 



where 7 = -^ jrp 



4_R2 

In an earlier version of this paper, Theorem 5 was proved in a similar manner, but only 
for R = 1. The more general case was proved in an early version of [LN]. We thank the 
authors of [LN] for allowing us to include the general version here. 

The above results suggest that for every Chevalley group scheme G, the upper and lower 
limiting constants, a±{G{Os)) are equal to each other, and depend only on G and not 
on O. In fact, we can make a precise conjecture, for which we need to introduce some 
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additional notation. Let G be a Chevalley group scheme of dimension d = dim(G') and rank 
a. = rk{G). Let k = |<I>+| denote the number of positive roots in the root system of G, and let 
R = R(G) = ^ = f . We see that if G is of type Ae (resp. 5^, Q, D^, ^2,^4,^5, ^7,£'8) 
theni?= ^, (respi,£,£- 1,3,6,6,9,15). 

Conjecture. Let k,0, and S be as in Theorem 3, and suppose that G is a simple Chevalley 
group scheme. Then 



'R{R+l)-R 
a+{G{Os)) = a.{G{Os)) ' 



4fl2 

The conjecture reflects the belief that "most" subgroups oi H = G{'L/m'L) lie between 
the Borel subgroup B oi H and the unipotent radical of B. We prove here the lower bound 
of the general conjecture (under the same assumptions as in Theorem 3 and 4). In our earlier 
version this was done only for Galois extensions, but it was observed in an earlier version 
of [LN] that a small modification of the argument works in the general case. We thank the 
authors of [LN] for allowing us to make these small modifications here. 

This paper gives a complete proof of the upper bound for the case of SL2 , based on the 
known detailed classification of subgroups of 6*^2 (Fg) for finite fields F^ of order q. We 
also give a partial result towards the upper bound in the general case. The upper bound is 
proved in full for every field k in [LN]. The reader is also referred to a more general version 
there when G is not assumed to be split. 



Theorem 6. Let k, O, and S be as in Theorem 3. Let G be a simple Chevalley group scheme 

2t 



of dimension d and rank £, and R = R{G) = ^^J-, then: 



(a) Assuming GRLL or the assumptions of Theorem 4, 

(^R{R+1)-Ry 1 

(b) There exists an absolute constant G such that 

{./RiRTTj - R)' 



a+iGiOs))< G 



4i?2 



Remark: As the upper bound is proved in full in [LN] (i.e., C = 1 in part (b)) we omit in 
this paper the proof of part (b) of Theorem 6. 
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Corollary 7. There exists an absolute constant C such that for d = 2, 3, . . . 
(1 - o(l))^ < a.{SL,{Z)) < a+{SL,{Z)) < C^. 

This greatly improves the upper bound a+(S'Lrf(Z)) < jd? impHcit in [Lu] and settles a 
question asked there. 

As a byproduct of the proof of Theorem 5 in §6 we obtain the following. 

Corollary 8. The subgroup growth type of SL(i{'Lp) is at least n^ where 

c = (3 - 2V2)d^ - 2(2 - V2), 

and where Zp denotes the ring of p-adic integers. 

The counting techniques in this paper can be applied to solve a novel extremal problem in 
multiplicative number theory involving the greatest common divisors of pairs {p — l,p' — 1) 
where p,p' are prime numbers. The solution of this problem does not appear amenable to 
the standard techniques used in analytic number theory. Considering this problem first was 
crucial for obtaining Theorem 5. 

Theorem 9. For n — > oo, let 

M{n) = max < If gcd{p — 1, p' — 1) V = set of distinct primes where \\p<n\. 
I p.p'^v pev ) 



p,p 

Then we have: 

lim 



pev 

log M(n) 1 



A(n) 4' 

where A(n) = (log n)^/ log log n). 

The paper is organized as follows. 

In §1, we present some required preliminaries and notation. 

In §2, we introduce the notion of a Bombieri set which is the crucial ingredient needed 
in the proof of the lower bounds. We then use it in §3 and §4 to prove the lower bounds of 
Theorems 2, 3, 4, and 6. We then turn to the proof of the upper bounds. In §5, we show how 
the counting problem of congruence subgroups in SL2{'L) can be completely reduced to an 
extremal counting problem of subgroups of finite abelian groups; the problem is actually, as 
one may expect, a number theoretic extremal problem - see §6 and §7 where this extremal 
problem is solved and the upper bounds of Theorems 2, 3, and 4 are then deduced in §8. 
Finally, in §9 we prove Theorem 9. 

The results of this paper are announced in [GLNP]. 
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§1. Preliminaries and notation 

Throughout this paper we let 

£(n) = /°f" , A(n)-^^°^")' 



log log n log log n 

All logarithms in this paper are to base e. If / and g are functions of n, we will say that / is 
small w.r.t. q if lim "S-')"^ = 0. We say that f is small if f is small with respect to n^^^' . 
Note that if / is small, then multiplying C„(r) by / will have no effect on the estimates of 
a+(r) or a_(r). We may, and we will, ignore factors which are small. 

Note also that if e{n) is a function of n which is smaller than n 
(i.e., loge(n) = o(logn)) then: 

/I i\ T^ logC'„e(„)(r) 

(1.1) hm — = a+r 

\{n) 

and 

(1.2) ^ A(n) ="-^^^- 
The proof of (1.1) follows immediately from the inequalities: 

,^. ^ iogg„(r) — iogC„,(„)(r) 

a+(r) = hm — < hm — ^ 

_ ^ logC'n£(n)(r) A(rag(n)) 
A(ne(n)) A(n) 

< a+(r)-i 

= «+(r)- 

Here, we have used the fact that lim "fa) = 1) which is an immediate consequence of the 
assumption that e{n) is small with respect to n. A similar argument proves (1.2). 

It follows that we can, and we will sometimes indeed, enlarge n a bit when evaluating 
C„(r), again without influencing a;+ or a_. Similar remarks apply if we divide n by e(n) 
provided e(n) is bounded away from 0. 

The following lemma is proved in [Lu] in a slightly weaker form and in its current form 
is proved in [LS, Proposition 5.1.1]. 

Lemma 1.1. ("Level versus index"). Let T he as before. Then there exists a constant c > 
such that if H is a congruence subgroup ofT of index at most n, then H contains T{m) for 
some m < en, where m G Z and by r{m) we mean T{mOs). 
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n 

Corollary 1.2. Let 7n(r) = ^ Sn{G{0 $ / mO $)) , where for a group H , Sn{H) denotes 

the number of subgroups of H of index at most n. Then we have a+(r) = lim °^T^^ ' and 
a-(r)=limi2^. 

Proof. By Lemma 1.1, C„(r) < 7cn(r) for some c > 0. It is also clear that 7ra(r) < n-CniX)- 
Since c is small w.r.t. n, Corollary 1.2 follows by arguments of the type we have given 
above. D 

The number of elements in a finite set X is denoted by ij^X or \X\. The set of subgroups 
of a group G is denoted by Sub(G). 

§2. Bombieri Sets. 

We introduce some additional notation. Let a, q be relatively prime integers with g > 0. 
For X > 0, let V{x; q, a) be the set of primes p with p < x and p = a (mod q). For a = 1, 
we set V{x; q) = V{x; q, 1). We also define 

i?(x;g,a)= ^ logp. 

pGT'(a;;(j,a) 

If /(x), (/(x) are arbitrary functions of a real variable x, we say /(x) ~ g{x) as x — > oo if 

l,m 44 = 1. 

x^ca g[X) 

Define the error term 

X 



E{x; q, a) = 'd{x; q, a) 



where 4>{q) is Euler's function. Then Bombieri proved the following deep theorem [Bo], [Da]. 

Theorem 2.1. (Bombieri) Let A > be fixed. Then there exists a constant c{A) > such 
that 

X 



(loga;)'4 

as X ^ oo. 



V max max \E{y;q,a)\ < c{A) ■ -— 

^-^ y<a; (a,q) = l ' (logx)^ ^ 



This theorem shows that the error terms max E{x; q, a) behave as if they satisfy the 

(a,q) = l 

Riemann hypothesis in an averaged sense. 
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Definition 2.2. Let x be a large positive real number. A Bombieri prime (relative to x) 
is a prime q < \fx such that the set V{x, q) of primes p < x with p = 1 (mod q) satisfies 

max \E{y;q,l)\ < 



y<x ' ' 4>{q) {log x)^' 

We callV{x,q) a Bombieri set (relative to x). 

Remark. In all the applications in this paper, we do not really need q to be prime, though it 
makes the calculations somewhat easier. We could work with q being a "Bombieri number" . 

Lemma 2.3. FixO < p < ^. Then for x sufficiently large, there exists at least one Bombieri 
prime (relative to x) q in the interval 

<q<xP. 

log a; 

Proof. Assume that 

max|£;(y;g, 1)1 > 



<x ' ' 4>{q)(logx] 



2 



for all primes j-^-^ 1^ q 1^ x^, i.e., that there are no such Bombieri primes in the interval. In 
view of the trivial inequality, <j){q) = q — 1 < q, it immediately follows that 

\:^ Irr ^^l ^ ^ ^i^ ^ ^ x-(logloga;) 
> max\E{y;q,l)\ > > - > — , 

^ y<x' (logX 2 ^ q 2p-(logx)3 

log X — ^ — log X — ^ — 

say, for sufficiently large x. This follows from the well known asymptotic formula [Lan] for 
the partial sum of the reciprocal of the primes 



V - = logiogy + ^ + of-i-) 

^ q \logYj 



q<Y 

as y ^ oo. Here b is an absolute constant. This contradicts Theorem 2.1 with ^ > 8 
provided x is sufficiently large. D 

Lemma 2.4. Let V{x, q) be a Bombieri set. Then for x sufficiently large 

X 



#Vix,< 



(t>{q) log X 



< 3 



(p{q){logx) 



Proof. We have 

'!9(n; q, 1) — 'i?(n — 1; q, 1) 



E i = E 



logn 

peV{x,q) n=2 



E^("''^'i)(iJr 



n=2 

X 



1 \ T?(x; q, 1) 



log(n) log(n +1)/ log([x] + 1 



y ^n; q, 1) J^gli±iL + ^(^^ ^^ 1) _ ^(,; ,, i) (^ I 

^-^ ' ' lognlog(n+l) log re ylogx log([x] + 1) 
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It easily follows that 



E 1 

p^'P{x,q) 



logx 



< ^T9(n;g, i; 



n=2 



n ■ (logn)^ 



^{x;q,l) 



1 



1 



logx log(x + 1] 



By the property of a Bombieri set, we have the estimate |??(n; q, 1) — -rf^l < 



for 



n < X. Since 



log X log(a; + l ^ \ x(log x)'^ 



0(g) I ^ 0(g)(logx)2' 

, the second expression on the 



logx log(a: + l) 

right side of the above equation is very small and can be ignored. It remains to estimate 

X 

the sum ^ i?(n; g, l) ^.j^ ^p . This sum can be broken into two parts, the first of which 

n=2 



corresponds to n < 
estimate 



V 'd{n;q,l) — 

. ^-^ n ■ lo; 



n^^^yj. , which is easily seen to be very small, so can be ignored. We 

1 



(log x)-^ 



r<n<.x 



(logn)^ 



E 



n 



{log x)'^ 



<.n<.x 



(f){q) n(logn) 



+ O 



E 



1 



(log x) 



<.n<.x 



(j){q) (log x) ^ n(log n) 



< 



E 

(loga;)'5 

3 X 



1 



(t){q){logn) 



O 



(/<(g)(logx)3 



2 4>{q){logx) 



which holds for x sufficiently large and where the constant | is not optimal. Hence 



E 1 

p6P(x,g) 



nx;q,l) 



logx 



< 



7 



A4>{q){logx) 



say. Since |^?(x;g, 1) - ^| < 



4>(q) I — 4>(q)(logxy-' 



, Lemma 2.4 immediately follows. D 



§3. Proof of the lower bound over Q. 

In this section we consider the case of k = Q and O = Z. 

Fix a real number < po < ^- It follows from Lemma 2.3 that for x ^ oo there exists 
a real number p which converges to po, and a prime number g ~ x^ such that V{x,q) is a 
Bombieri set. 

Define 

p= n p. 

p e 'P(x,q) 
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It is clear from the definition of a Bombieri set that 

logP ~ ^^x^-'' 

Hi) 

and from Lemma 2.4 that 

L = #V{x,q)^—- . 

(p[qj iogx log a; 

Consider T{P) = ker(G(Z) -^ G{Z/PZ)) which is of index at most pdim(G) -j^ p^ j^^^g 
that for every subgroup H/T{P) in T /T{P) there corresponds a subgroup i^ in T of index 
at most pdim(G) -^^ Y. 

By strong approximation 

r/r(p) = G{z/pi.) ^ JJ G(Fp). 

Let -B(p) denote the Borel subgroup in G{¥p). Then 

1 (^T^( <\ dim(G) + rk(G) ^ 
log {#B{p)j ~ ^ logp, 

where rk(G) denotes the rank of G as an algebraic group. But 

log (#G(Fp)) ~ dim(G) log p. 

It immediately follows that (for p — > oo) 

log [G(W,) : B(p)] ~ "'"'"'' -■"'°' logp, 



and, therefore, 



log [GiZ/PZ) : BiP)] ^ ^MG)-rHG) ^^^^ 



where B{P) < G{Z/PZ) is: 

B{P)= n i?(Fp) 






Now B{p) is mapped onto F^ and, hence, is also mapped onto {Z/qZ) ' since 

a^Fp = p — 1 and p = 1 (mod q). So B{P) is mapped onto 



(Z/qZ) 



rk(G)-L 
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where 



L = #P(x,g)~ 



x^-P 



(f){q) log X log X 

For a real number 9, define \9~\ to be the smallest integer t such that 9 < t. Let < a < 1. 
We will now use Proposition 6.1, a basic result on counting subspaces of finite vector 
spaces. It follows that B{P) has at least 

o-(l-cr)rk(G)^L^+0(rk(G)-L) 

subgroups of index equal to 

Hence, for x — > oo, 

log ('#{subgroups}) = (^(1 - a)Tk{GfL'^ + 0(rk(G) • L)) logq 

~ a{l-a)rk{Gf- -^-plogx, 

(logxj^ 



while 



and 



log(i) = \a ■ rk(G) • L] • log g + i (dim(G) - rk(G)) log P 

~rk(G)fj^^^ plogx + -(dim(G)-rk(G))xi-'' 

= (a- p- rk(G) + ^ (dim(G) - ik{G)))x^-P, 



loglog(i) ~ (1 — p) logx. 



It is clear from the estimate for log/, above that given any index n >> we can choose x 
such that log l ~ log n. We compute 



log f #{subgroups} j o-(i _ a) ■ rk(G)2 • p 



2 „ X 



2-2p 



log X 



;iog(index))Vloglog(index) ((a-p-Tk{G) + ^{dim{G)-Tk{G)))x^-py/{l-p)logx 

a{l-a)p{l-p)-Tk{Gr 
((a/9-i) • rk(G) + i dim(G) 

as X — > oo. 
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We may rewrite 

a(l - a)p{l - p) ■ rk(G)2 a(l - a)p{l - p) 



(ap- i) •rk(G) + idim(G))' i^^P + R? 

where 

dim(G) - rk(G) 

2 • rk(G) • 

Now, for fixed R, it is enough to choose cr, p so that 

a{l-a)p{l-p) 

{up + RY 

is maximized. This occurs when 

p = (j = ^/R{R + 1)-R, 

in which case we get 

2 



a{l-a)p{l-p) [VR(R + ^~R 



{ap + RY 4fi2 

Actuahy, we choose p^ to be ^JR{R+\) — R, then we can take p to be asymptotic to 

Po as X is going to infinity. Note that -^^ j^ ^ < jq^ holds for all i? > 0. This 

follows from the easy inequality ^yR{R + 1) — R < ^. It is also straightforward to see that 

/ 7 T 1 (A/i?(i?+l)--R)^ -, 

\/ R{R + 1) — R converges to 2 as i? ^ oo hence -^^ j^ — ~ ig^- 

In the special case when R = 1, we obtain the lower bound of Theorem 2. For a simple 
Chevalley group scheme over Q, this gives the lower bound in Theorem 6. 



§4. Proof of the lov^^er bound for a general number field. 

To prove the lower bounds over a general number field we need an extension of the 
Bombieri- Vinogradov Theorem to these fields, as was obtained by Murty and Murty [MM]. 

Let us first fix some notations: 

Let A; be a finite extension of degree / over Q, K its Galois closure of degree d, g = 
Gal(-ff/Q), and Ok the ring of integers in k. For a rational prime q and a; G M, we will 
denote by VK{x-,q) the set of rational primes p = 1( mod q) where p splits completely in 
K and p < x. Let 

vf K (x, g) = #Vk {x,q), i>K{x,q) = ^ log p, 

and, 

- X 

d<l){q) 
We shall show that the following theorems follow from Murty and Murty [MM]. 
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Theorem 4.1. Let K he a fixed finite Galois extension of Q. Assume GRH (generalized 
Riemann hypothesis) for K and all cyclotomic extensions K{Q) with i a rational prime and 
Q a primitive £^^ root of unity. Then for every < p < ^ '^'^'^ x ^ oo, there exists a 
rational prime q such that 

(a) ^<q<xP 

V / log X — ^ — 

(b) \T[K{x,q) - rf/0(q)ioga.| < 3 yd'(t>(q){\ogxy^ 

(c) max|^^(y,g)| < aT^^^jfj^^^^ , 

where d' = [K : Q\/t and t denotes the degree of the intersection of K and the cyclotomic 
field 



Remark: In fact, GRH gives a stronger result than what is stated in Theorem 4.1. For 
example, it can be shown that for every prime q < x'^ the error terms in parts (b), (c), take 
the form O ix^ log{qx) J (see [MMS] for a more precise bound). Theorem 4.1 is stated in 
this special form because it can be proved unconditionally in some cases. 

Theorem 4.2. Theorem 4-1 can be proved unconditionally for K if either: 

(a) = Ga\{K/Q) has an abelian subgroup of index at most 4 (this is true, for example, 
if k is an abelian extension); 

or 

(b) [K:Q]< 42. 

Theorem 4.3. Theorem 4-i is valid unconditionally for every K with the additional 
assumption that < p < -, where rj is the maximum of 2 and d* — 2, and where d* is 
the index of the largest possible abelian subgroup of q = Gal(i^/Q). In particular, we may 
take r] = d*-2ifd*>4andri = 2ifd*<4. 

Proof of Theorems 4-i - 4-3- For any e > 0,yl > 0, under the assumptions of Theorem 4.1 
or 4.2 (a), Murty and Murty [MM] prove the following Bombieri theorem: 



(4.1) > max max 

^_ {a,q) = l y<x 
q<x2 ^ 



( , |C| 1 , X 

-o(y,g,a)-p-^.(y) 



< 



(log x) 
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Here C denotes a conjugacy class in g, 7r(y) = Yl,p<y 1' 

TTc{x,q,a)= ^ 1, 

p<.x 

{p,K/Q)=C 

p=a (mod q) 

p unramified in K 

and {p, K/Q) denotes the Artin symbol. 

In fact, under the assumption of the GRH, equation (4.1) holds, but without assuming 
GRH they showed that (4.1) holds when the sum is over q < x'^~^ where rj is defined as 
follows: Let 

(4.2) d* = minmax[0 : H]w{l) 

H w 

The minimum here is over all subgroups H of Gal{K/Q) satisfying: 

(i) if n C ^ 0, and 

(ii) for every irreducible character w oi H and any non-trivial Dirichlet character x, the 
Artin L-series L(s, w ® x) is entire. 

Then the maximum in (4.2) is over the irreducible characters of such ii's. 

Now 

_ J d* - 2 if d* > 4 

^ ~ 1 2 if d* < 4 

We need their result for the special case when C is the identity conjugacy class. In this 
case TTT = ^ and 7rc{y, q, 1) = 7^k{y, q)- So for proving Theorem 4.3 we can take for H an 
abelian subgroup of smallest index and then H satisfies assumption (i) and (ii). (Recall that 
abelian groups satisfy (AC) - Artin conjecture, i.e. L{s,w (8) x) ^^^ entire - see [CF]). 

For Theorem 4.2(a), again take H to be the abelian subgroup of index at most 4. It 
satisfies (i) and (ii) and this time rj = 2. 

For Theorem 4.2(b): Going case by case over all possible numbers d < 42, one can deduce 
by elementary group theoretic arguments that every finite group q of order d < 42, has an 
abelian subgroup of index at most 4, unless d = 24 and g is isomorphic to the symmetric 
group 5*4. But for this group, Artin [CF] proved Artin's conjecture in 1925. Moreover, every 
irreducible character of 5*4 is of degree at most 4. Thus for g = 6*4 we have d* = 4 and so 
r/ = 2. 

The proofs of Theorems 4.1, 4.2 and 4.3 follow now in the same manner as in §2. 

Using Theorems 4.1, 4.2, 4.3, we can now prove the lower bounds of Theorem 3 and 4 
just as in §3. Note that for every prime p £ Vxix, q) we may take an ideal vr = 7r(j>) in Ok 



COUNTING CONGRUENCE SUBGROUPS 15 

with [Ok : tt] = p,TT nZ = pZ. Let 

P= H n{p). 

Then, since x -^ oo, we may choose q,p (using Theorem 4.1) so that 

d(l){q) d ' ' d 0(g) log X dlogx' 

and 

G{o/p)= n G{o/Ti{p))^ n ^(z/pz). 

We can now take for every rational prime p G Vkixjq), the Borel subgroup B{p) as in §3 
and define: 

BiP) = II B{p). 

P&'Pk(x,q) 

Then B{P) is mapped onto {Z / qZf'^'^y ^ and 

log [aiOIP) : B(P,] ~ "'"'"^'; '•^'^' . log[0 : P]. 

Thus, by a computation similar to the one in §3 (note that the d's cancel in this computation), 
we can show that 



JR{R+1)-R 
a.{G{0))> - 



4fl2 

The lower bounds of Theorems 3, 4, and 6 are now also proved. We now turn to the proof 
of the upper bounds. 

§5. Prom SL2 to abelian groups 

In this section we show how to reduce the estimation of a-^-{SL2{Z)) to a problem on 
abelian groups. 

Corollary 1.2 shows us that in order to give an upper bound on a+(r) it suffices to bound 
Sn{G{Z/rnL)) when m < n. Our first goal is to show that we can further assume that m 
is a product of different primes. To this end denote m = Wp where p runs through all the 
primes dividing m. 

We have an exact sequence 

l^K ^ GiZ/mL) -^ G{Z/m,Z) -^ 1 

where i^ is a nilpotent group of rank at most dim G. Here, the rank of a finite group G is 
defined to be the smallest integer r such that every subgroup of G is generated by r elements, 
(see [LS, Window 5, §2]). 
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Lemma 5.1. Let 1 —^ K —^ U — > L ^> 1 be an exact sequence of finite groups, where K is 
a solvable group of derived length i and of rank at most r. Then the number of supplements 
to K in U (i.e., of subgroups H of U for which tt{H) = L) is bounded by \U\^'^ '^^^ . 

Proof. See [LS, Corollary 1.3.5]. 

Corollary 5.2. s„(G(Z/mZ)) < m-^'(^™'=')'°si°sm.g^((^(^/^^)) ^f^^^^ /'(diniG) depends 
only on diniG. 

Proof. Let H he a subgroup of index at most n in G{'L/m'L) and denote L = tt{H) < 
G(Z/mZ). So L is of index at most n in G(Z/mZ). Let U = tt~^{L), so every subgroup H of 
G{'L/raL) with tt{H) = L is a subgroup of U . Given L (and hence also U) we have the exact 
sequence 1 ^ K ^ U — > L ^ 1 and by Lemma 5.1, the number of H in U with tt{H) = L 
is at most \U\^f^^^ where £ is the derived length of K, r < diniG is the rank of K and 
fi^) ^ /(dim G) where / is some function depending on r and independent of m (say /(r) = 
3j,2 _|_ j,^_ Now \U\ < TTi'i'™^ and K being nilpotent, is of derived length 0(loglog |i^|). We 
can, therefore, deduce that s„(G(Z/mZ)) < m'='i™^-'^(^™^)(^°siogm+iogdimG)^^^(^^^/^^)) 
for some constant c which proves our claim. 

Corollary 1.2 shows us that in order to estimate q;+(G(Z)) one should concentrate on 

Sn{G{'L/m'L)) with m < n. Corollary 5.2 implies that we can further assume that ttt, is a 

t 
product of different primes. So let us now assume that m = Y\ Qi where the Qi are different 

i = l 

primes and so G{'L/m'L) c:^ 'WG{'L/qiL) and t < (1 + o{l)) ^^°f^^ . We can further assume 

that we are counting only fully proper subgroups of G{'L/m'L), i.e., subgroups H which do 

not contain G{'L/qiL) for any 1 < i < t, or equivalently the image of H under the projection 

f 
to G{'L/qiL) is a proper subgroup (see [Lu]). Thus H is contained in W Mi where Mi is a 

maximal subgroup of G{'L/qi'L). 

Let us now specialize to the case G = SL2, and let q he a prime. 

Maximal subgroups of SL2{'L/q'L) are conjugate to one of the following three types of 
subgroups (see [La, Theorem 2.2, 2.3, pp. 183-185]). 

(1) B = Bq -the Borel subgroup of all upper triangular matrices in SL2. 

(2) D = Dq -a dihedral subgroup of order 2{q — 1) or 2(g -|- 1) which is equal to N{Tq) 
the normalizer of a split or non-split torus Tq . The group Tq is either the diagonal subgroup 
or is obtained as follows: Let ¥q2 he the field of order q'^,¥\ acts on Fg2 by multiplication. 
The latter is a 2-dimensional vector space over Fg. The elements of norm 1 in F^2 induce 
the subgroup Tq of 5*^2 (F^). 
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(3) A = Aq-a subgroup of SL2{'L/qL) which is of order at most 120. 

There are only boundedly many conjugacy classes of each type. Also, the number of 
conjugates of every subgroup is small, so it suffices to count only subgroups of SL2{1>/m'L) 
whose projection to SL2{'L/qL) (for q\m) is inside either B,D, or A. 

Let S C {qi . . . ,g(} be the subset of the prime divisors of m for which the projection 
of H is in Aq. and S the complement to S. Let m = Y\ Q ^i^d H the projection of H to 

q€S 

S L2{'L /rnL) . So if is a subgroup of index at most n in SL2{'L/rriL) and the kernel N from 
H ^ H is inside a product of IS*! groups of type A. As every subgroup of SL2{1i/q'L) is 
generated by two elements, H is generated by at most 2 i°f^^ < ^ logtogn generators. Set 
k = [2 jp°^""y^ + 1] and choose k generators for H. By a lemma of Gaschiitz (cf. [FJ, Lemma 
15.30]) these k generators can be lifted up to give k generators for H. Each generator 
can be lifted up in at most |A'^| ways and A'^ is a group of order at most 120''^' < 120* < 

log n 

120i°siog" . We, therefore, conclude that given H the number of possibilities for H is at most 

;^202(logn)V(loglogn)2 ^Jji^Jj -g ^^^^^ W.r.t. n^(") . 

We can, therefore, assume that S = (f) and all the projections of H are either into groups 
of type B ox D. 

Now, Bq , the Borel subgroup of S'L2(Z/gZ), has a normal unipotent cyclic subgroup Uq 
of order q. Let now S be the subset of {qi, . . . ,qt} for which the projection is in B and 
^-the complement. Then 

H<llBqXllDq. 

9eS q(^s 

Let H be the projection of ii^ to Y\ Bq/Uq x Y\ Dq. The kernel is a subgroup of the cyclic 

group U = Y\ Uq. By Lemma 5.1 we know that given H, there are only few possibilities 

qes 
for H. We are, therefore, led to counting subgroups in 

L=llBq/UqXllDq. 

I^S q^s 

H Bq/Uq xllTq, 



Let E now be the product 



and for a subgroup H of L we denote H n E hy H. 

Our next goal will be to show that given H in E, the number of possibilities for H is 
small. To this end we formulate first two easy lemmas, which will be used in the proof of 
Proposition 5.6 below. This proposition will complete the main reduction. 
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Lemma 5.3. Let H be a subgroup ofU = Ui x 1/2- For i = 1,2 denote Hi = Tri{H) where 
TTi is the projection from U to Ui, and Hf = H FiUi. Then: 

(i) Hf is normal in Hi and Hi/H^ — H2/H2 with an isomorphism ip induced by the 
inclusion of H/{Hf x H2) as a subdirect product of Hi/Hf and H2/H2, 

(ii) H is determined by: 

(a) H, fori = 1,2 

(b) Hf fori = 1,2 

(c) the isomorphism ip from Hi/H^ to H2/H2. 

Proof See [Su, p 141]. D 

Definition 5.4. Let U be a group and V a subnormal subgroup of U . We say that V is 
co-poly-cyclic in U of co-length i if there is a sequence V = Vq <\ Vi < . . . < Vg = U such 
that Vi/Vi-i is cyclic for every i = 1, . . . ,L 

Lemma 5.5. Let U be a group and F a subgroup ofU. The number of subnormal co-poly- 
cyclic subgroups V of U containing F and of co-length £ is at most \U : F\^ . 

Proof. For £ = 1, V contains [U, U]F and so it suffices to prove the lemma for the abelian 
group U = U/[U, U]F and F = {e}. For an abelian group U, the number of subgroups V 
with U/V cyclic is equal, by Pontrjagin duality, to the number of cyclic subgroups. This is 
clearly bounded by \U\ < \U : F\. If ^ > 1, then by induction the number of possibilities 
for Vi as in Definition 5.4 is bounded hy \U : F\^~^. Given Vi, the number of possibilities 
for V is at most \Vi : F\ < \U : F\ by the case £ = 1. Thus, V has at most \U : F\^ 
possibilities. D 

Proposition 5.6. Let D = Di x . . . x Dg where each Di is a finite dihedral group with a 
cyclic subgroup Ti of index 2. Let T = Ti x . . . x Tg, so, \D : T\ = 2^ . The number of 
subgroups H of D whose intersection with T is a given subgroup LofTis at most |L'|^2^* . 

Proof. Denote Fi = Y\ ^i- We want to count the number of subgroups H of D with 

j>i 

H r\T = L. Let Li = projp-. (L) i.e., the projection of L to Fi, and Li^i = Li f] i^i+i, 
so Li_|_i C Li_(_i. Let Hi be the projection of H to Fi. Given H, the sequence {Hi = 
H,H2,... ,Hs) is determined and, of course, vice versa. We will actually prove that the 
number of possibilities for {Hi, . . . ,Hs) is at most |Z)|*2^'* . 

Assume now that iifi+i is given. What is the number of possibilities for Hil Well, Hi 
is a subgroup of Fi = Di x F^+i containing Li, whose projection to Fi^^i is -ffi+i and its 
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intersection with Fi+i, which we will denote by X, contains Li_|_i. By Lemma 4.2, Hi is 
determined by Hi^i,X, Y, Z and (p where Y is the projection of Hi to Di, Z = Hid Di and 
(p is an isomorphism from Y/Z to Hi^i/X. Now, every subgroup of the dihedral group is 
generated by two elements and so the number of possibilities for Y and Z is at most l-D^p 
each, and the number of automorphisms of Y/Z is also at most |I?ip. 

Let us now look at X : X is a normal subgroup of -ff^+i with Hi^i/X isomorphic to 
Y/Z, so it is meta-cyclic. Moreover, X contains Li^i. So by Lemma 4.3, the number of 
possibilities for X is at most \Hi^i : Lj_|_ip. 

Now \Hi+i : Li+i| < \Hi^i : Li+i||Li+i : Li+i|. We know that |ifi+i : Li+i| = 
|proJF,+i(-H") : proJF,+i(-^)| < 1^ : -^1 < 2* and \Li+i : Li+i| = |proJF,+i(-^i) : -Fj+i n Li\ < 
|A|. So, |F,+i :Z,+i| <2«-|A|. 

Altogether, given Hi^i (and L and hence also L^'s and L^'s) the number of possibilities for 
Hi is at most |Z?i|*2^'*. Arguing, now by induction we deduce that the number of possibilities 
for {Hi, . . . , Hs) is at most |D|*2^'* as claimed. D 

Let's now get back to SL2: Proposition 5.6 implies, in the notations before Lemma 5.3, 
that when counting subgroups of 

L=llBjU,xllD„ 

we can count instead the subgroups of 

E=llBjU,xl[T, 

-jes qes 

where Tg is a torus in SL2 (Z/gZ) (so Tg is a cyclic group of order q—1 or q + 1 while Bg/Ug 
is a cyclic group of order g — 1). 

A remark is needed here: Let i:f be a subgroup of index at most n in SL2{'L/m'L) which 
is contained in X = H -^g ^ W ^g ^^^ contains Y = Y\ Ug x v Y\ {e}- By our analysis in 

this section, these are the groups which we have to count in order to determine a_|-(5'L2(Z)). 

We proved that for counting them, it suffices for us to count subgroups of Xq/Y where 

Xq = n i?q X n Tg. Note though that replacing H with its intersection with Xq, may 

^es qes 

enlarge the index of H in 5*^2 (Z/mZ). But the factor is at most 

2logm/loglogm _ ^^1/ log log m ^ ^1/ log log n 

As n ^ oo, this factor is small with respect to n. By the remark made in §1, we can deduce 
that our original problem is now completely reduced to the following extremal problem on 
counting subgroups of finite abelian groups: 
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Let V- = {gi, . . . , gt } and P_|_ = {q'^, . . . , g^/} be two sets of (different) primes and let 
V = V-[jV+. Denote 



/(n) = sup{s,(x)ix = nc..-i X n^^Hi} 

where the supremum is over all possible choices of V- , P+ and r such that 

t t' 

and where Cm denotes the cyclic group of order m. The discussion above implies: 
Proposition 5.7. We have 

a+(5L2(Z))=h^^°S^^") 



X{n) 



§6. Counting subgroups of p-groups 

In this section we first give some general estimates for the number of subgroups of finite 
abelian p-groups which will be needed in §7. As an application we obtain a lower bound for 
the subgroup growth of uniform pro-p-groups (see definitions later) . 

For an abelian p-group G, we denote by ^i{G) the subgroup of elements of order dividing 
p^. Then Qi{G)/Qi-i{G) is an elementary abelian group of order say p^' called the i-th. 
layer of G. We call the sequence Ai > A2 > . . . > A^ the layer type of G. It is clear that this 
sequence is decreasing. 



Denote by 



the p-binomial coefficient, that is, the number of z/-dimensional subspaces 



of a A-dimensional vector space over Z/pZ. 

The following holds (see [LS, Proposition 1.5.2]). 



Proposition 6.1. 

(i) p'^i^-'^) < 

.... [A 

(11) max 



< p^ ■ p' 



v(\-v) 



is attained for v = [^] in which case 



p^x'+o{x) f^^l^g ^g X 



CX). 



The starting point is the following well-known formula (see[Bu]). 
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Proposition 6.2. Let G be an abelian p-group of layer type Xi > X2 ■ ■ ■ > Xr- The number 
of subgroups of layer type vi > 1^2 ■ ■ ■ is 



TTp'^«+l(^i-'^i 



i>l 



Xi — Vi+i 



D 



(In the above expression we allow some of the i^i to be 0.) 
We need the following estimate. 



Proposition 6.3. 



i>l 



i>l 



Vi - Vi+l 



< p^l Y\ p^ii.^i-^i) _ 



i>l 



Proof. By Proposition 6.1 we have 






Xi — l^i+i 



< TT r)'^» + l('*'^~'^«) . pi'^i-'^i+l)i{^i-'^i+l)-i'^i-'^i+l)) . jji'^ i - '^ i + 1) 



i>l 



P^l Y\ p^i + l{^i-^i) . p{^^-^^+l 



)(>^i-Vi) _ Vl 



p'^n^ 



J^i(>^i-Vi) 



i>l 



i>l 



The lower bound follows in a similar way. D 

Corollary 6.4. Let G be an abelian group of order p"" and layer type Ai > A2 > . . . > A^ 

Then\G\-^Y{p^^^^^ < |Sub(G)| < IG]"^ l\ p^'^/^ holds. 



i>l 



i>l 



Proof Considering subgroups H of layer type [-^] > [-^] > . . . we obtain that 
|Sub(G)| > Y[ p^^'^'^''~^^'' > p~^ Y[ p^^'"^ which implies the lower bound. 

On the other hand, for any fixed layer type fi > 1^2 ^ ■ ■ ■ the number of subgroups H 
with this layer type is at most 

The number of possible layer types j^i > i^2 > • • • of subgroups of G is bounded by 
the number of partitions of the number a hence it is at most 2" < \G\. This implies our 
statement. D 

Let us make an amusing remark which will not be needed later. 

If G is an abelian p-group of the form G = C^^ x Gx^ x . . . x Cxt then it is known (see 
[LS, §1.10]) that |End(G)| = fl gcd{xj,Xk). Noting that H gcd(xj,Xfc) = H P^' we 

j,k>l j,k>l 

obtain that 

\G\-^\End{G)\^ < |Sub(G)| < |Gp|End(G)|i 



i>l 
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These inequalities clearly extend to arbitrary finite abelian groups G. 

For the application of the above results to estimating the subgroup growth of SL(i{'Lp) we 
have to introduce additional notation. For a group G let G^ denote the subgroup generated 
by all fe-th powers. For odd p a powerful p-group G is a p-group with the property that 
G/G^ is abelian. (In the rest of this section we will always assume that p is odd, the case 
p = 2 requires only slight modifications.) G is said to be uniformly powerful {uniform, for 
short) if it is powerful and the indices \G^ : G^ \ do not depend on i as long as i < e, 
where p'^ is the exponent of G. 

Now let G be a uniform group of exponent p^ , where e = 2i , with d generators. Then 
G^ is a homocyclic abelian group of exponent p* and d generators (i.e. it has layer type 
d,d, . . . ,d with i terms) [Sh]. 

Consider subgroups H of G^ of layer type z/, i/, . . . ,1/ (z terms). The number of such 
subgroups is at least p^^yd.-^) by Proposition 6.3. . The index n of such a subgroup H in G 
is pd-^+id--'^)^ _ Hence the number of index n subgroups in G is at least n^ where x = -hr-^- 
Substituting v = [d{2 — \/2)] we see that x can be as large as (3 — 2\/2)d — (a/2 — 1). 

Let now [/ be a uniform pro-p-group of rank d , i.e. an inverse limit of d-generated 
finite uniform groups G. Then we see that for infinitely many n we have Sn{G) > 

^{3-2V2)d-{V2-l)^ 

Now SL(i{1jp) is known to have a finite index uniform pio-p- subgroup of rank d^ — 1 
(see[DDMS, Theorem 5.2]). This proves the following 

Proposition 6.5. The group SLd{1^p) has subgroup growth of type at least n^^~'^'^^'^ -2(2-V2)^ 

B. Klopsch proved [Kl] that if G is a residually finite virtually soluble minimax group of 
Hirsch length h{G) then its subgroup growth is of type at least n^^ ''"^ . By using the above 
argument one can improve this to 7T,(3-2v2)ft,(G)-(V2-i)_ 

§7. Counting subgroups of abelian groups 

The aim of this section is to solve a somewhat unusual extremal problem concerning 
the number of subgroups of abelian groups. The result we prove is the crucial ingredient 
in obtaining a sharp upper bound for the number of congruence subgroups of SL{2,Z). 
Actually we prove a slightly more general result which will be used in [LN] to obtain similar 
bounds for other arithmetic groups. 

We will use Propositions 6.2 and 6.3 in conjunction with the following simple (but some- 
what technical) observations. 
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Proposition 7.1. Let R > 1 and letC,t € N be fixed. Consider pairs of sequences {Xi} , {vi} 
of nonnegative integers, such that Xi < t for all i and X]i>i(^-^i + ^0 — ^■ 

Under these conditions the maximal value of the expression A{{X}, {i^}) = X]i>i '^iiK — i^i) 
can be attained by a pair of sequences {Xi},{i^i}, i = l,2,..,r such that: 

(i) Ai > A2 > ... > Xr, t-'i > 1^2 > ■■■ > J^r > 1; and Xi > Ui for all i, 

(ii) Ai = A2 = ... = Xr-i = t and 

(iii) for some 0<6<r — 1 we have 1/1 = 1/2 = ■■■ = Vb = ^b+i + 1 = ... = t'r-i + 1- If 
Xr = t then also Ur G {'^i^ '^1 ~ !}• 

Proof. Suppose the maximum of yl({A}, {v}) is attained by a pair {Ai}, {z^i} of sequences of 
non-negative integers. Deleting pairs with i/j = does not change the value of yl({A}, {i/}) 
hence we can assume that all i^i > 1. If Aj < Vj for some j, then we can delete Aj and 
Vj from the sequences and in this way the value of ^({A},{z/}) increases, a contradiction. 
Hence we have that A^ > f^ for all i. By relabelling the indices we can further assume that 
z^i ^ z^2 ^ ... > t'r ^ 1. 

Now, if vr is a permutation of {1, 2, ..., r}, it is clear that the maximum of ^^ X„(^i^i/i (and 
hence of yl({A7r(i)}, {fi})) is achieved for permutations vr such that A7r(i) > X^(2) ^ ••• ^ 
A7r(r). By the maximality of the pair {Ai},{^'i} it now follows that Ai > A2 > ... > A^ as 
well, proving (i). We shall call a pair of sequences {A},{z^} satisfying (i) good. 

Let j be the smallest index such that we have t > Aj > Aj+i > 1 (if there is no such j 
then (ii) holds). 

Assume that Aj+i = . . . = Xj+k and Xj^k > Xj+k+i or j + k = r. The condition 
i^j > i^j+k implies that z/j((Aj + l)-z/j) + z/j+fc((Aj+fc-l)-^'j+fc) > i^jiXj-i/j) + i/j+kiXj+k- 
fj+k)- If Xjj^k = ^j+k then (by deleting some terms and relabelling the rest) we can replace 
our sequences by another good pair for which ^ Xj is strictly smaller and the value of 

A{{Xi}, {vi\) is the same. Otherwise, replacing Xj by Xj + 1 and Aj+fc by Aj+fc — 1 we obtain 
a good pair of sequences for which {Ai} is lexicographically strictly greater and for which 
A{{Xi},{vi}) is at least as large (hence maximal). 

It is clear that by repeating these two types of moves we eventually obtain a good pair 
{Xi},{ui} satisfying (ii) as well. 

Now set /? = z^i + z/2 + • . . + ^r-\- Then 

y~] Vi{Xi - Vi) = t(3 - {vl + . . . + l4_^) + Vr{Xr - Vr) ■ 
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It is clear that if the value of such an expression is maximal, then the difference of any 
two of the Vj with j < r — 1 is at most 1. Part (iii) follows. D 

Proposition 7.2. Let a;i,a:2, • • • ,a^t he positive integers such that at most d of the Xi can 
he equal. Then 



1=1 ^ ' 



holds. 

Proof. If say, xi is the largest among the Xi then xi > 4. By induction we can assume that 



n x^ > ('-fA holds. Then 



* t/t-iV~^ /t\/t-i\*"^ /t\Vt-i^*"^ 



d\ ed ) \ed ) \ ed ) \ed ) \ t 

1=1 ^ 

L\ ^ >(L\ □ 

ed f \*-i - led ' ' 



1 



1 
t-i 



as required. 

The main result of this section is the following. 

Theorem 7.3. Let R> 1 be a real number and d he a fixed integer > 1. Let n, r be positive 
integers. Let G he an ahelian group of the form G = C^^ x G^^ x . . . x G^^ where at most 
d of the Xi can be equal. Suppose that r\G\^ < n holds. Then the number of subgroups of 

order < r in G is at most n^"<~^°^ '''^^'^' where 7 = — j^ . In particular if R = 1 

then 'J = 3ii|v:i. 

Proof. We start the proof with several claims. 

Claim 1. t< (l + o(l))^(n). 
Proof. By Proposition 7.2 we have (^) < n. This easily implies the claim. 

Claim 2. In proving the theorem, we may assume that t > jl{n). 

Proof. For otherwise, every subgroup of G can be generated by ^i{n) elements hence 

|Sub(G)| < |Gp^(") <n'^^W. 

Now let a{n) be a monotone increasing function which goes to infinity sufficiently slowly. 
For example, we may set a(n) = log log log log n. 
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Let Gp denote the Sylow p-subgroup of G and let A^ > A2 > • • • denote the layer type 

of Gp. Loosely speaking, we call any layer of some Gp, a layer of G. We call such a layer 

essential if its dimension A^ is at least ^4^ . Clearly the essential layers in Gp correspond to 

the layers of a certain subgroup Ep of Gp (which equals Qi{Gp) for the largest i such that 

Af > ^17^)- ^^^ ^^ ^^^^ ^ — Y\^p the essential subgroup of G. 

p 
Claim 3. Given E r\T we have at most n°^^^"'' (i.e., a small number) of choices for a 

subgroup T of G. 

Proof. It is clear from the definitions that every subgroup of the quotient groups Gp/ Ep and 
hence of G/E can be generated by less than -pi elements. Therefore the same is true for 
T/T n E. This implies the claim. 

By Claim 3, in proving the theorem, it is sufficient to consider subgroups T of E. 

Let V denote the exponent of E. Then E is the subgroup of elements of order dividing v 
in G. Now V is the product of the exponents of the Ep hence the product of the exponents 
of the essential layers of G. It is clear from the definitions that we have t;^(")/'*(") < n, hence 
V < (logn)''^"). Using well-known estimates of number theory [Ra] we immediately obtain 
the following. 

Claim 4. (i) the number z of different primes dividing v is at most , °f " < , " , °^i °^ "" . 

^ '' f Q log log V — log log log n 

c ca{ii) 

(ii) The total number of divisors of v is at most v i°8 1°8 ^ < log n i°8 1°8 1°8 " for some constant 
00. 

Claim 5. \G:E\> (logn)(i+°(i))*. 

Proof. Consider the subgroup E^ = E f] C^^ ■ It follows that E^ is the subgroup of elements 

of order dividing v in C^;. . Set e^ = \E^\ and hi = Xi/ti. It is easy to see that E = W E^ , 

i>i 
hence \G : E\ = Y\ hi. 

By Claim 4(ii) for the number s of different values of the numbers e^ we have s = 
(logn)"*^^-*. We put the numbers Xi into s blocks according to the value of e^. By our 
condition on the Xi it follows that at most d of the numbers hi corresponding to a given 
block are equal. Hence altogether ds of the hi can be equal. Using Proposition 7.2 we obtain 
that \G■.E\>Ylh^> (i^)*. 

Since sd= (logn)°(i) and by Claim 2 t > 7 j3^||^ we obtain that \G : E\ > (logn)(i+°(i))* 
as required. 

Let us now choose a group G and a number r as in the theorem for which the number 
of subgroups T < E oi order dividing r is maximal. To complete the proof it is clearly 
sufficient to show that this number is at most n^'^'^°^^''^^'^'. 
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Denote the order of the corresponding essential subgroup E hy f and the index jG : i?| 
by m. 

Using Propositions 6.2 and 6.3 we see that apart from an n°^^^^'' factor (which we ignore) 
the number of subgroups T as above is at most 



(7 1) nn^ 



p1/«>i 



for some uf, Af where {A^}, {I'f} is a pair of sequences for every ^,1111 P^' divides / and 

p i>l 

n n P""^ divides r. Assuming that f^r is fixed together with the upper bound t for all the 

p i>l 

X^,H^, let us estimate the value of the expression (7.1). 

By Proposition 7.1 a maximal value of an expression like (7.1) is attained for a choice of 
the A^, lyf (for the sake of simplicity we use the same notation for the new sequences) such 
that for every p there are at most 3 different pairs {p^^ ,p^i ) equal to say 

{p\p^'+'), ip\p^'), and ip^\p^o) 

where fi^ < t^ < t and fi^ < t for all p. 

Exchange the pairs equal to the first type for pairs equal to {p^,p^ )■ We obtain an 
expression like (7.1) such that the ratio of the two expressions is at most 



p i>l 

If now there are say a^ pairs with {p^i ,p^i ) equal to {p*,p^ ) then take (3^ to be the 
largest integer with 2^^ < p° and set /3f = [ log2 p] . (Note that for every p there is at most 
one pair of the form (p"^ ,p^n).) 

Consider the expression 

(7.2) TT2'3V(t-M'') 2f^>o('^''-t^o). 

p 

Its value may be less than that of (7.1) but in this case their ratio is bounded by (2^^)* n 
(where z is the number of primes dividing v). Hence this ratio is at most 

To prove our theorem it is sufficient to bound the value of (7.2) by n^'<~^°W)^("-) _ 
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It is clear that the value of (7.2) is equal to the value of another expression 

(7.3) "rT2^fc(^fc-'^0 

fc>i 

for appropriate sequences {Afc},{/Ufc} which both have '^{(3'^ + /3f) terms and for which 

Afc,/ifc < t and also fl 2^^'+"'= < /"^ • ^ i-e. Efe>i(«Afc + Uk) < log(r/«). (*) 

fc>i 

More precisely, the sequence {Xk} has X]/^^ terms equal to t and /3f terms equal to r^ 

p 
for every p, while {nk} consists of fj,^ repeated (3^ times and Hq repeated /3f times each (in 

the appropriate order) . 

By Proposition 7.1 the expression 7.3 attains its maximal value for some sequences 
{Afc}, {z^fc} such that all but one of the A^, say Xa+i are equal to t and we have 
1^1 = 1^2 = ■ ■ ■ = J^b = ^ + ^b+i = • • • = 1 + t'a for some b < a. 

Consider now the expression 

(7.4) TT2''fc(^^-'^^) 

fc>i 

where 

t = A'i = ... = A; (A;+i = 0) 

and Ua = l^'l = 1^2 = ■ ■ ■ = t^'a (K+i = 0)- 

It easily follows that the value of (7.3) is at most 2^* times as large as the value of (7.4) 
and 22*' = n°(^(")). Hence it suffices to bound the value of (7.4) by n(T+°("))^("). 

To obtain our final estimate denote 2" by y, m}'* by w (where m = \G : E\) and set 
X = y ■ w. 

For some constants between and 1 we have y = x^ and u'l = at. Then 

1- i^^ 

w = X '^ = y p ■ 

Note that the condition (*) implies 2'**'^^+'^) = y'^^y^* < rf^. We have n > r{mf)^ > 
yCTt . yRt . ^m j^ence log n > t ■ log y (R + a + R^—^ ] . 



p 
By Claim 5 we have w > (logn)^^"'""^"'^-'^ Hence 



Therefore 



(1 + o(l)) log log n < logw = logy. 



(logn)2 t^ihgynR+^ + R^ .^ ,^,. 



log log n (i_£logy) 



[l + o{l)).tHogy{R + a + R^—^^^.{^-^^ 
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The value of (7.4) is y°"*(*-'^*) which as we saw is an upper bound for the number of subgroups 
R (ignoring an n"^^^"^'' factor). Hence 

log (number of subgroups T) 



/_(logra)f 

V log log T 



^ log log n ^ 

<(l + o(l)) 



t2 1ogy(/? + a + fll^)2(^) 
= (1 + o(l))^ii^^l^ = (1 + o(l))^il^^MW) . 

As observed in §3, the maximum value of "^ /j^'v^^sT^ for o", p G (0, 1) is 7. The proof of 
the theorem is complete. D 

By using a similar but simpler argument, one can also show the following 

Proposition 7.4. Let G he an ahelian group of order n of the form 

G = C^i X C^, X . . . X C^^ where xi > X2 > ...Xf Then |Sub(G)| < n(ii5+°(i))^("). This 

bound is attained if Xi = t ■ i for all i. 

Combining this result with an earlier remark, we obtain that n'4+°(i))*(") is the maximal 
value of ]^gcd(xi,Xj) where the Xi are different numbers whose product is at most n. 

id 

Note that |Sub((7)| is essentially the number of subgroups T of order [yl^] (^^e [Bu] 
for a strong version of this assertion). Hence Proposition 7.4 corresponds to the case 
R= 1, r ~ n^''^ of Theorem 7.3. 

§8. End of proofs of Theorems 2, 3, and 4. 

Theorem 2 is actually proved now: the lower bound was shown as a special case of -R = 
R{G) = 1 in §3. For the upper bound, we have shown in Proposition 5.7 how a^{SL2{1')) 
is equal to lim °wA" (see Proposition 5.7 for the definition of f{n)). But Theorem 7.3 
implies, in particular, that f{n) is at most n^^^°^^'''''^'^' where 7 = ——^^- This proves that 
a-(_(5'L2(Z)) < 7 and finishes the proof. 

The proof of Theorem 3 is similar, but several remarks should be made: The lower bound 
was deduced in §4. For the upper bound, one should follow the reductions made in §6. The 
proof can be carried out in a similar way for SL2{0) instead of SL2{'L) but the following 
points require careful consideration. 

1) One can pass to the case that m is an ideal which is a product of different primes vr^'s 
in O, but it is possible that O /iTi is isomorphic to O/iij. Still, each such isomorphism class 
of quotient fields can occur at most d times when d= [/c : Q] . 



COUNTING CONGRUENCE SUBGROUPS 29 

2) The maximal subgroups of 5*^2 (Fg) when ¥q is a finite field of order q {q is a prime 
power, not necessarily a prime) are the same B,D and A as described in (1), (2), and (3) 
of §5. 

The rest of the reduction can be carried out in a similar way to §5. The final outcome 
is not exactly as f{n) at the end of §5, but can be reduced to a similar problem when f{n) 
counts Sr{X) when X is a product of abelian cyclic groups, with a bounded multiplicity. 
Theorem 7.3 covers also this case and gives a bound to f{n) which is the same as for f{n). 
Thus a+{SL2{0)) < 7 = ^^. 

We finally mention the easy fact, that replacing O by Os when S* is a finite set of primes 
(see the introduction) does not change a+ or a_. To see this one can use the fact that for 
every completion at a simple prime vr of O, 0(0-,^) has polynomial subgroup growth and 
then use the well known techniques of subgroup growth and the fact that 

G{d) = G{ds) X TT G{0^) 

7Tes\v^ 

to deduce that a(G(0)) = a{G{ds)). 

Another way to see it, is to observe that G{Os) is a quotient of G{0), and, hence, 
a+(G(C>)) > a-^-{G{Os))- On the other hand, the proof of the lower bound for a{G{0)) 
clearly works for G{Os)- Theorem 3 is, therefore, now proved, as well as Theorem 4 (since 
we have not used the GRH for the upper bounds in Theorem 3). 

§9. An extremal problem in elementary number theory. 

The counting techniques in this paper can be applied to solve the following extremal 
problem in multiplicative number theory. 
For n -^ oo, let 



Mi(n) = max < I [ gcd{ai,aj] 



i<i,j<t 



<t,ai < a2 < ■ ■ ■ < at G Z, IT a^ < 



n 



M2(n) = max < If gcd{p — 1, p' — 1) V = set of distinct primes where T f p < n >. 
I P,p'&'P pev ) 

We shall prove the following theorem which can be considered as a baby version of Theorem 
2 (compare also to Theorem 7.3 ). Note that Theorem 9.1 immediately implies Theorem 9. 

Theorem 9.1. Let X(n) = li^siill. Then 

^ ' log log n 

,. log Ml (n) log Ms (n) 1 

hm -— - — = hm -— - — = -. 

— A(n) A(n) 4 
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Proof. Recall that if oi, 02, . . . ,at G Z and G = Ca^ x Ca2 x • • • x Ca^ is a direct product of 
cyclic groups then by §7, 

|Gr^|End(G)|^ < |Sub(G)| < jGp |End(G)|^ 

and 

|End(G)| = n gcd(a„a,). 

l<i,j<t 

Proposition 7.4 implies that 

^^logMi(n) 1 

hm -— - — < -. 

A(n) 4 

It is clear that M2(n) < Mi(n), so to finish the proof it is enough to obtain a lower bound 
for M2(n). 

Now, for X —> 00 and j^—^ < q < x'' (with < p < ^) choose 

V =V{x,q) = \^p < x\p =1 (mod (7)}, 

to be a Bombieri set relative to x where g is a prime number (Bombieri prime). By Lemma 
2.4 we have the asymptotic relation ^V{x,q) ~ '^hfTkrH' ^^ order to satisfy the condition 

Y\ p < n, we choose x ~ glogn. Without loss of generality, we may choose q = x^ for 

per 

some < p < 2- It follows that 

i-p , 1 loglogn x (l-p)logn 

x ^~logn, logx~— , jj^V = #V{x,q)r^ ——- ~— — . 

I — p (p[q) log X log log n 

Consequently 

gcd(p — 1, p — 1) > g^* ' > (x^) (i°gi°g")^ ~ e logiogn 

Let now p go to 2 and the theorem is proved. D 
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